Joint Maximum Margin and Maximum Entropy Learning of Graphical Models

نویسنده

Jun Zhu

چکیده

INFERRING structured predictions based on correlated covariates remains a central problem in many fields, including NLP, computer vision, and computational biology. Typically, both the input covariates and output predictions can be high-dimensional, multi-modal, noisy, partially observable, and bearing latent structures, each of these characteristics adds a degree of complexity to the task of learning structured input/output (I/O) models. Several recent approaches to structured I/O learning are based on learning discriminative probabilistic graphical models. By defining composite features that explicitly exploit the structured dependencies among input elements (e.g., words in a sentence) and among the interpretational outputs (e.g., part-of-speech tags), such models can produce semantically consistent predictions from complex inputs. However, how to train such models properly remains a highly contested issue. The two dominant paradigms for training such models are the maximum (conditional) likelihood estimation (MLE) [6], which leads to the well-known CRF, and the max-margin learning [9], [10], which leads to the MN. While both methods have enjoyed remarkable success and are widely used, they have a number of deficiencies, as we will discuss below. In this project, we introduce a new paradigm for learning structured I/O models, and graphical models in general, that conjoins and extends the merits of MLE and maxmargin learning while avoiding their shortcomings.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Maximum Entropy Discrimination Markov Networks

Standard maximum margin structured prediction methods lack a straightforward probabilistic interpretation of the learning scheme and the prediction rule. Therefore its unique advantages such as dual sparseness and kernel tricks cannot be easily conjoined with the merits of a probabilistic model such as Bayesian regularization, model averaging, and ability to model hidden variables. In this pape...

متن کامل

Conditional and joint models for grapheme-to-phoneme conversion

In this work, we introduce several models for grapheme-tophoneme conversion: a conditional maximum entropy model, a joint maximum entropy n-gram model, and a joint maximum entropy n-gram model with syllabification. We examine the relative merits of conditional and joint models for this task, and find that joint models have many advantages. We show that the performance of our best model, the joi...

متن کامل

Partially Observed Maximum Entropy Discrimination Markov Networks

Learning graphical models with hidden variables can offer semantic insights to complex data and lead to salient structured predictors without relying on expensive, sometime unattainable fully annotated training data. While likelihood-based methods have been extensively explored, to our knowledge, learning structured prediction models with latent variables based on the max-margin principle remai...

متن کامل

The Integration of Dependency Relation Classification and Semantic Role Labeling Using Bilayer Maximum Entropy Markov Models

This paper describes a system to solve the joint learning of syntactic and semantic dependencies. An directed graphical model is put forward to integrate dependency relation classification and semantic role labeling. We present a bilayer directed graph to express probabilistic relationships between syntactic and semantic relations. Maximum Entropy Markov Models are implemented to estimate condi...

متن کامل

22 : Hilbert Space Embeddings of Distributions Lecturer : Eric

The application of classical optimization techniques to Graphical Models has led to specialized derivations of powerful paradigms such as the class of EM algorithms, variational inference, max-margin and maximum entropy learning. This view has also sustained a conceptual bridge between the research communities of Graphical Models, Statistical Physics and Numerical Optimization. The optimization...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2010

Joint Maximum Margin and Maximum Entropy Learning of Graphical Models

نویسنده

چکیده

منابع مشابه

Maximum Entropy Discrimination Markov Networks

Conditional and joint models for grapheme-to-phoneme conversion

Partially Observed Maximum Entropy Discrimination Markov Networks

The Integration of Dependency Relation Classification and Semantic Role Labeling Using Bilayer Maximum Entropy Markov Models

22 : Hilbert Space Embeddings of Distributions Lecturer : Eric

عنوان ژورنال:

اشتراک گذاری